Clustering Sentence-Level Text Using a Fuzzy Back- Propagation Clustering Algorithm
نویسندگان
چکیده
In comparison with hard clustering methods, in which a pattern belongs to a unique cluster, clustering algorithms with fuzziness allow patterns with differing degrees of membership to belong to all clusters. This is important in domains such as sentence clustering, as a sentence may belong to more than a topic present within a document or set of documents. Since most sentence similarity measures do not represent sentences in a common metric space, traditional fuzzy clustering approaches are generally not applicable to sentence clustering. This paper presents a back propagation fuzzy clustering algorithm. The algorithm uses a graph representation of the data, and operates in an Back Propagation framework in which the graph centrality of an object in the graph is interpreted as a likelihood. Results of applying the algorithm to sentence clustering tasks demonstrate that the algorithm is suitable of identifying more clusters of related sentences, and that it is therefore of potential use in a variety of text mining tasks. Keywords— Sentence Clustering, Fuzzy clusters, Back Propagation, Page ranks, Membership values
منابع مشابه
Sentence Level Text Clustering using a Hierarchical Fuzzy Relational Clustering Algorithm
Clustering is the process of grouping or aggregating of data items. Sentence clustering mainly used in variety of applications such as classify and categorization of documents, automatic summary generation, organizing the documents, etc. In text processing, sentence clustering plays a vital role this is used in text mining activities. Size of the clusters may change from one cluster to another....
متن کاملClustering Sentence-Level Text Using a Novel Fuzzy Relational Clustering Algorithm
In comparison with hard clustering methods, in which a pattern belongs to a single cluster, fuzzy clustering algorithms allow patterns to belong to all clusters with differing degrees of membership. This is important in domains such as sentence clustering, since a sentence is likely to be related to more than one theme or topic present within a document or set of documents. However, because mos...
متن کاملSurvey on Clustering Algorithm for Sentence Level Text
Clustering is an extensively studied data mining problem in the text domains. The difficulty finds numerous applications in customer segmentation, classification, collaborative filtering, visualization, document organization, and indexing. In text mining, clustering the sentence is one of the processes and used within general text mining tasks. Several clustering methods and algorithms are used...
متن کاملOptimal Sentence Clustering Using An Innovative Hierarchical Fuzzy Clustering Algorithm
The role of data clustering is inevitable in many text processing activities .Many proceedings are going on in this area since it has wider applications. Sentence clustering is a challenging task when compared with other data clustering, because a sentence is able to represent same ideas in different ways. For E.g. some people see a glass as half empty and some others see half full. Due to this...
متن کاملProviding a Method to Identify Malicious Users in Electronic Banking System Using Fuzzy Clustering Techniques
Money-Laundering causes a higher prevalence of crime and reduces the desire tending to invest in productive activities. Also, it leads to weaken the integrity of financial markets and decrease government control over economic policy. Banks are able to prevent theft, fraud, money laundering conducted by customers through identification of their clients’ behavioral characteristics. This leads to ...
متن کامل